Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

A New Large Urdu Database for Off-Line Handwriting Recognition

Identifieur interne : 000A30 ( Main/Exploration ); précédent : 000A29; suivant : 000A31

A New Large Urdu Database for Off-Line Handwriting Recognition

Auteurs : Waqas Sagheer [Canada] ; Lei He [Canada] ; Nicola Nobile [Canada] ; Y. Suen [Canada]

Source :

RBID : ISTEX:AFD8623B176383EBF861535E742EA064CF9FFAA0

Abstract

Abstract: A new large Urdu handwriting database, which includes isolated digits, numeral strings with/without decimal points, five special symbols, 44 isolated characters, 57 Urdu words (mostly financial related), and Urdu dates in different patterns, was designed at Centre for Pattern Recognition and Machine Intelligence (CENPARMI). It is the first database for Urdu off-line handwriting recognition. It involves a large number of Urdu native speakers from different regions of the world. Moreover, the database has different formats – true color, gray level and binary. Experiments on Urdu digits recognition has been conducted with an accuracy of 98.61%. Methodologies in image pre-processing, gradient feature extraction and classification using SVM have been described, and a detailed error analysis is presented on the recognition results.

Url:
DOI: 10.1007/978-3-642-04146-4_58


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">A New Large Urdu Database for Off-Line Handwriting Recognition</title>
<author>
<name sortKey="Sagheer, Waqas" sort="Sagheer, Waqas" uniqKey="Sagheer W" first="Waqas" last="Sagheer">Waqas Sagheer</name>
</author>
<author>
<name sortKey="He, Lei" sort="He, Lei" uniqKey="He L" first="Lei" last="He">Lei He</name>
</author>
<author>
<name sortKey="Nobile, Nicola" sort="Nobile, Nicola" uniqKey="Nobile N" first="Nicola" last="Nobile">Nicola Nobile</name>
</author>
<author>
<name sortKey="Suen, Y" sort="Suen, Y" uniqKey="Suen Y" first="Y." last="Suen">Y. Suen</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:AFD8623B176383EBF861535E742EA064CF9FFAA0</idno>
<date when="2009" year="2009">2009</date>
<idno type="doi">10.1007/978-3-642-04146-4_58</idno>
<idno type="url">https://api.istex.fr/document/AFD8623B176383EBF861535E742EA064CF9FFAA0/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">001128</idno>
<idno type="wicri:Area/Istex/Curation">001073</idno>
<idno type="wicri:Area/Istex/Checkpoint">000552</idno>
<idno type="wicri:doubleKey">0302-9743:2009:Sagheer W:a:new:large</idno>
<idno type="wicri:Area/Main/Merge">000A38</idno>
<idno type="wicri:Area/Main/Curation">000A30</idno>
<idno type="wicri:Area/Main/Exploration">000A30</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">A New Large Urdu Database for Off-Line Handwriting Recognition</title>
<author>
<name sortKey="Sagheer, Waqas" sort="Sagheer, Waqas" uniqKey="Sagheer W" first="Waqas" last="Sagheer">Waqas Sagheer</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Canada</country>
<wicri:regionArea>CENPARMI (Centre for Pattern Recognition and Machine Intelligence) Computer Science and Software Engineering Department, Concordia University, Montreal, Quebec</wicri:regionArea>
<wicri:noRegion>Quebec</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Canada</country>
</affiliation>
</author>
<author>
<name sortKey="He, Lei" sort="He, Lei" uniqKey="He L" first="Lei" last="He">Lei He</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Canada</country>
<wicri:regionArea>CENPARMI (Centre for Pattern Recognition and Machine Intelligence) Computer Science and Software Engineering Department, Concordia University, Montreal, Quebec</wicri:regionArea>
<wicri:noRegion>Quebec</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Canada</country>
</affiliation>
</author>
<author>
<name sortKey="Nobile, Nicola" sort="Nobile, Nicola" uniqKey="Nobile N" first="Nicola" last="Nobile">Nicola Nobile</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Canada</country>
<wicri:regionArea>CENPARMI (Centre for Pattern Recognition and Machine Intelligence) Computer Science and Software Engineering Department, Concordia University, Montreal, Quebec</wicri:regionArea>
<wicri:noRegion>Quebec</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Canada</country>
</affiliation>
</author>
<author>
<name sortKey="Suen, Y" sort="Suen, Y" uniqKey="Suen Y" first="Y." last="Suen">Y. Suen</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Canada</country>
<wicri:regionArea>CENPARMI (Centre for Pattern Recognition and Machine Intelligence) Computer Science and Software Engineering Department, Concordia University, Montreal, Quebec</wicri:regionArea>
<wicri:noRegion>Quebec</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Canada</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2009</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">AFD8623B176383EBF861535E742EA064CF9FFAA0</idno>
<idno type="DOI">10.1007/978-3-642-04146-4_58</idno>
<idno type="ChapterID">58</idno>
<idno type="ChapterID">Chap58</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: A new large Urdu handwriting database, which includes isolated digits, numeral strings with/without decimal points, five special symbols, 44 isolated characters, 57 Urdu words (mostly financial related), and Urdu dates in different patterns, was designed at Centre for Pattern Recognition and Machine Intelligence (CENPARMI). It is the first database for Urdu off-line handwriting recognition. It involves a large number of Urdu native speakers from different regions of the world. Moreover, the database has different formats – true color, gray level and binary. Experiments on Urdu digits recognition has been conducted with an accuracy of 98.61%. Methodologies in image pre-processing, gradient feature extraction and classification using SVM have been described, and a detailed error analysis is presented on the recognition results.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Canada</li>
</country>
</list>
<tree>
<country name="Canada">
<noRegion>
<name sortKey="Sagheer, Waqas" sort="Sagheer, Waqas" uniqKey="Sagheer W" first="Waqas" last="Sagheer">Waqas Sagheer</name>
</noRegion>
<name sortKey="He, Lei" sort="He, Lei" uniqKey="He L" first="Lei" last="He">Lei He</name>
<name sortKey="He, Lei" sort="He, Lei" uniqKey="He L" first="Lei" last="He">Lei He</name>
<name sortKey="Nobile, Nicola" sort="Nobile, Nicola" uniqKey="Nobile N" first="Nicola" last="Nobile">Nicola Nobile</name>
<name sortKey="Nobile, Nicola" sort="Nobile, Nicola" uniqKey="Nobile N" first="Nicola" last="Nobile">Nicola Nobile</name>
<name sortKey="Sagheer, Waqas" sort="Sagheer, Waqas" uniqKey="Sagheer W" first="Waqas" last="Sagheer">Waqas Sagheer</name>
<name sortKey="Suen, Y" sort="Suen, Y" uniqKey="Suen Y" first="Y." last="Suen">Y. Suen</name>
<name sortKey="Suen, Y" sort="Suen, Y" uniqKey="Suen Y" first="Y." last="Suen">Y. Suen</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000A30 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000A30 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:AFD8623B176383EBF861535E742EA064CF9FFAA0
   |texte=   A New Large Urdu Database for Off-Line Handwriting Recognition
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024